Corpus: eng_news_2019_100K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 93 98 99 99 99
1000 799 970 993 998 999
10000 5477 8910 9724 9886 9937
100000 27744 72727 91057 96811 98728
1000000 27745 72728 91058 96812 98729


Zipf's diagram for sentence endings


Gnuplot diagram

12926 msec needed at 2021-05-28 09:06